Mixture Component Clustering for Efficient Speaker Verification
نویسندگان
چکیده
In speaker verification (SV) systems based on a support vector machine (SVM) using Gaussian mixture model (GMM) supervectors, a large portion of the test-stage computational load is the calculation of the a posteriori probabilities of the feature vectors for the given universal background model (UBM). Furthermore, the calculation of the sufficient statistics for the mean also contributes substantially to computational load. In this paper, we propose several methods to cluster the GMMUBM mixture components in order to reduce the computational load and speed up the verification. In the adaptation stage, we compare the feature vectors to the clusters and calculate the a posteriori probabilities and update the statistics exclusively for mixture components belonging to appropriate clusters. Our results, demomstrate that (on average) we can, reduce the number of a posteriori probability calculations by a factor up to 2.8× without loss in accuracy.
منابع مشابه
Exploiting GMM-based Quality Measure for SVM Speaker Verification
In this paper, we examine the problem of quality measurement for speaker verification using support vector machines (SVMs). An efficient Gaussian mixture models (GMMs) based quality estimation algorithm is proposed to potentially utilize speaker-specific broad acoustic-class characteristics. Some verification strategies are also considered in the test phase. We perform clustering-based vector p...
متن کاملCompute Efficient Training Method for Gaussian Mixture Model Based Speaker Verification
Speaker Verification is a memory and compute intensive process, giving rise to area and latency concerns in the way of its System-On-a-Chip implementation. The training schemes for computing the speaker models contribute significantly to the overall complexity in the implementation of the system. In this paper, we demonstrate that K-Means algorithm can be used to realize compute efficient train...
متن کاملEfficient text-independent speaker verification with structural Gaussian mixture models and neural network
We present an integrated system with structural Gaussian mixture models (SGMMs) and a neural network for purposes of achieving both computational efficiency and high accuracy in text-independent speaker verification. A structural background model (SBM) is constructed first by hierarchically clustering all Gaussian mixture components in a universal background model (UBM). In this way the acousti...
متن کاملFully Bayesian speaker clustering based on hierarchically structured utterance-oriented Dirichlet process mixture model
We have proposed a novel speaker clustering method based on a hierarchically structured utterance-oriented Dirichlet process mixture model. In the proposed method, the number of speakers can be determined from the given data using a nonparametric Bayesian manner and intra-speaker variability is successfully handled by multi-scale mixture modeling. Experimental result showed that the proposed me...
متن کاملEfficient Text-Independent Speaker Identification using Optimized Hierarchical Mixture Clustering
Conventional Speaker Identification(SI) Systems uses individual Gaussian Mixture Models(GMM) for every speaker. If this method used for the large population Speaker identification systems, then during identification, likelihood computations between an unknown speaker's test feature vectors and speaker models has become a time-consuming process. This approach also increases the computationa...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2012